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AFLP-based method for integrating physical and genetic maps 

Field of the invention 
The present invention pertains to the field of biotechnology, more in particular to 
the field of genomic mapping and more in particular to the field of linking physical and 
genetic markers and to the field of integrating physical and genetic maps. The invention 
pertains to a method for the integration of physical and genetic m^s, more in particular to 
the high throughput building of integrated genetic and physical maps. 



Background of the invention 
Integrated genetic and physical genome m^s are extremely valuable for map-based 
gene isolation, comparative genome analysis and as sources of sequence-ready clones for 
genome sequencing projects. The effect of the availability of an integrated inap of physical . 
and genetic markers of a species for genome research is enormous. Integrated maps allow 
for precise and rapid gene mapping and m^ing of all microsatellite loci and other 
^pUcation such as SNA marker based gene manipulation. Various methods have been 
developed for assembling physical maps of genomes of varying complexity. One of the 
better characterised approaches use restriction enzymes to generate large numbers of DNA 
firagments from genomic subclones (Brenner et al, Proc. Natl. Acad. Sci., (1989), 86, 
8902-8906; Gregory et al. Genome Res. (1997), 7, 1162-1 168; Marra et al., GeLomQ Res. 
(1997), 7, 1072-1084). These fingerprints are compared to identify related clones and to • 
assemble overlappmg clones m contigs. The utility of fingerprinting for ordering a 
complex genome is limited, however, due to variation in DNA migration from gel to gel, 
the presence of rq)etitive DNA sequences, unusual distribution of restriction sites and 
skewed clone representation. Moreover, fingerprinting alone, unless combined with other 
methods, does not link genomic clones directly to genetic maps. Therefor most high quality 
physical maps of complex genomes have been constructed usmg a combination of 
fingerprinting and PCR-based or hybridisation based methods. 

Selective restriction firagmeifl anq>lification or AFLP is known, for instance from 
the European patent appUcation 0 534 858 and US 6,045,994 by appUcaat and from an 
article by Vos et al Nucleic Acids Research (1995), 23, 4407-4414, incorporated herein by 
reference. In general, AFLP comprises the steps of: 

(a) digesting a nucleic acid, in particular a DNA or a cDNA, with one or more specific . 
restriction endonucleases, to fi:agment said DNA into a corresponding series of 
restriction fragments; 
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(b) ligating the restriction fragments thus obtained with at least one double-stranded 
synthetic oligonucleotide adapter, one end of which is compatible with one or both 
of the ends of the restriction fragments, to ther^y produce tagged restriction 
fragments of the starting DNA; 
. (c) contacting said tagged restriction fragments under hybridising conditions with at 
least one oligonucleotide primer; 

(d) amplifying said tagged restriction fragments hybridised with said primers by PGR 
or a similar technique so as to cause ftirther elongation of the hybridised primers 
along the restriction fragments of die starting DNA to which said primers 
hybridised; and 

(e) identifying or recovering the amplified or elongated DNA fragment thus obtained. 
The amplified DNA-fi»gments flius obtained can then be analysed and/or 

visualise4 for instance by means of gel-electrophoresis. This provides a genetic fingerprint 
showing specific bands corresponding to the restriction fragments which have been linked 
to the adapter, have been recognised by the primer, and thus have been amplified during 
the amplification step. The fingerprint thus obtained provides infomiation on the specific 
restriction site pattern of the starting DNA, and thus on the genetic make-up of ttie 
organism from which said DNA has been derived. 

AFLP can therefore be used to identify said DNA; to analyse it for the presence of 
specific restriction site patterns, restriction fragment length polymorphisms (RFLPs) and/or 
specific genetic markers (so-called "AFLP-markers"), which may be indicative of the 
presence of certain genes or genetic traits; or for similar purposes, for instance by 
comparing the results obtained to DNA-samples of known origin or restriction pattern, or 
data thereon. AFLP is eminentiy suited to characterise genetic markers by means of one or 
more of the AFLP fi:agment8 thus visualised. 

The primers used in AFLP are such that they recognise the adapter and can serve as 
a starting point for the polymerase chain reaction. To this end, the primers must have a 
nucleotide sequence that can hybridise with (at least part of) the nucleotide sequence of the 
ad^ter adjacent to the 3' end of the restriction fragment to be amplified. The primers can 
also contain one or more further bases (called "selective bases") at the 3'-end of their 
sequence, for hybridisation with any conq)lementary base or bases at flie 3'-end of the 
ad^ter Ugated restriction fragment. Located between the part of the primer that hybridises 
to the adapter and the selective bases that hybridise to the restriction fragment, the primer 
may contain a section that is capable of hybridising to the remains of the restriction site. 
Thus, in general an AFLP primer has the following stricture: adapter complementary part- 
restriction site remains complementary part-selective bases. The adapter complementary 
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part- restriction site remains complementazy part is generally depicted as the * constant, 
sequence* of the AFLP primer and the selective bases as the 'variable sequence'. 

As, of all the adapter-ligated restriction fragments present in the mixture^ only those 
fragments that contain bases complementary to the selective bases will subsequently be 
5 amplified, the use of these "selective" primers will reduce the total amoimt of bands in the 
final fingerprint, thus making the fingerprint more clear and more specific. Also, the use of 
different selective primers (i.e. different variable sequence) will g©tierally provide different 
fingerprints, which can also be used as a tool for the purposes of identification or analysis. 
The selective nucleotides are complementary to the nucleotides in the adapter- 

10 ligated restriction fragments that are located adjacent to the constant.primer sequence. 
Primers containing selective nucleotide are denoted as +N primers, in which N 
stands for the number of selective nucleotides present at the 3 '-end of fhe primer. N is . 
preferably selected fi^om amongst A, C, T or G. 

N may also be selected &om amongst various nucleotide alternatives, Le. 

1 5 compounds fliat are capable of nmnicking the behaviour of ACTG-nuQleotides but in 

addition thereto have other characteristics such as the capability of improved hybridisation 
compared to the ACTG-nucleotides or the capability to modify the stability of the duplex 
resulting from the hybridisation. Examples thereof are peptide nucleic acids (PNAs), 
locked nucleic acids (LNAs), inosine etc. When fhe amplification is performed with more 

20 than one primer, such as with PGR using two primers, one or both primers can be equipped 
with selective nucleotides. The number of selective nucleotides may vary, depending on 
the species or on other particulars determinable by the skilled man. In general the number 
of selective nucleotides is not more than 10, but at least 5, preferably 4, more preferably 3, 
most preferred 2 and especially prefmed is 1 selective nucleotide. 

25 A +1 primer thus contains one selective nucleotide, a +2 primer contains 2 selective 

nucleotides etc. A primer with no selective nucleotides (i.e. a conv^tional primer) can be 
depicted as a +0 primer (no selective nucleotides added). When a specific selective 
nucleotide is added, this is depicted by the notion +A or +C etc. 

By amplifying a set adapter ligated restriction fragments with a selective primer, a 

30 subset of adapter-ligated restriction fragments is obtained, provided that the 

complementary base is present at the appropriate position in the restriction firagment Using 
a +1 primer, for example, the complexity (and the nmnber of visualised fragmrats) of the 
amplified mixture is reduced by a factor 4 compared to a amplification with a non-selective 
primer (a +0 primer) Higher reductions can be achieved by using primers with multiple 

35 selective nucleotides, i.e. 16 fold reduction of the original multiplex ration is obtained with 
2 selective nucleotides etc. 
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As AFLP provides amplification of both strands of a double stranded starting 
DNA, AFLP advantageously allows for exponaitial amplification of the fragment, i e 
according to the series 2, 4, 8, 16, etc.. Also, AFLP requires no prior knowledge of the 
DNA sequence to be analysed, nor prior identification of suitable probes and/or the 
5 construction of a gene library fi-om the starting DNA. 

For a further description of AFLP, its advantages, its embodiments, as weU as the 
techniques, enzymes, adapters, primers and fiirther compounds and tools used therein, 
reference is made to EP-0 534 858, and to Vos et al Nucleic Acids Reseaix^h (1995). 23 
4407-4414 both pubhcations are incorporated herein by reference. Also, in the description 
10^ herembelow, the definitions given in paragraph 5.1 of EP-0 534 858 will be used, unless 
indicated otterwise. 

The potential of AFLP as a technology for the integration of physical and genetic 
maps has been recognised before. Klein et al in Genome Research, (2000), 10, 798-807 
have described the use of AFLP in the integration of physical and genetic m^s of 
Sorghum. The method of Klein et al. conqaises generating AFLP fingerprints using +3/+3 
selective primers of aU individual BAC clones in a Hhrary. The mefliod fiirther comprises 
the generation of pools of clones. The pools are also analysed by AFLP fingerprinting 
albeit using different restiiction enzymes and otherwise oflier circumstances compared to 
tiie generation of the AFLP fingeiprints of all the individual BACs. The use of different 
enzyme combmations renders the method difficult. The method of Klein et al although 
feasible, is complex and laborious and comes with several other disadvantages One of 
them IS that tiie method is unsuitable for positioning non-polymoiphic maricers on the 
integrated map. 
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It is a goal of the present invention to provide for an improved metiiod for the 
mtegration of physical and genetic maps. It is a further goal of the present invention to 
provide for an improved method based on AFLP. It is yet a further goal of the invention to 
provide for a high throughput method for the integration of physical and genetic maps that 
results m m^s of improved quality such as measured by increased marker densities or as a 
result of more reliable contig genwation. 

Description of tiie invention 
The present invention pertains to mefliods for the integration of physical and 
genetic markers and for flie integration of physical and genetic m^s. The pr^ent invention 
overcomes many of the previously encountered problems in tiie art and provides for 
sigmficant improvements and advantages tiiat will become clear flirou^ this description 
tiie clamis and tiie examples. The mefliod is based on AFLP and more in particular based 
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on the use of combinations of selective primers in AETJP. The method results in the 
integration of genetic and physical maps and the linking of physical and genetic markers. 
The invention further pertains to the use of AFLP primers in the methods of the invention 
and to a set of structurally deJGned AFLP primers. 

Detailed description of the invention 
In one aspect the invention pertains to a method for the integration of physical and 
genetic maps by associating a restriction fragment, preferably an AFLP fragment with a 
genetic marker, providing a Ubrary of clones containing inserts of the (part of) genome of 
interest, pooling the clones into pools and fingerprinting the pools using AFLP primer- 
pairs of higher selectivity, individually fingerprinting the clones in a library using AFLP 
primer-pairs of lower selectivity. The pools that contain a fragment corresponding to ,the 

fi»gment associated with the genetic marker are identified and the individual clones 
containmg the firagment corresponding to the firagment associated with the genetic markw 
are identified and linked. From the fingerprint patterns of the individual clones containing 
the fragment associated with the genetic marker a contig is generated. The contig is linked 
to the genetic marker on the genetic map and the process is repeated for all genetic, 
markers, thereby providing an integrated physical and genetic map. 

The notion 'primer-pairs of higher/lower selectivity' as used herein relates .to the 
number of selective nucleotides in the AFLP primers used for fingerprinting. Thus a +0/+0 . 
primer-pair is of a lower selectivity than a +0/+1 primer-pair or a +1/+0 printer-pair. 
Similarly a +3/44 primer-pair is of a higher selectivity than a +2/+2 primer-pair etc. 

In one aspect the invention pertains to a method for linking a genetic and a physical 
g^ome map comprising the steps of: . 

(a) providing individual genetic markers, preferably in the fonn of a genetic 
m^; 

(b) characterising each of the genetic markers by means of at least one AFLP 
fi-agment identified through AFLP fingerprinting; 

(c) providing an artificial chromosome library such as a BAG or YAC; 

(d) goierating a multitude of pools, each pool containing a multitude of 
artificial chromosomes from the library; 

(e) generating an AFLP fingerprint for each of the pools; 

(f) selecting firom the multitude of pools the individual pools in which the at 
least one AFLP firagment associated with a genetic marker is present in the 
fingerprint of the individual pool; 

(g) generating an AFLP fingerprint of the individual artificial chromosomes 
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from the pools identified imder (f); 

(h) selecting from the fingerprints of the individual artificial chromosomes of 
step (g) the individual artificial chromosome(s) in which the AFLP 
fragment associated with the genetic marker is present in the fing^rint; 

(i) generating a contig of the individual artificial chromosomes identified in 

step (h); 

Q) linking the contig obtained in step (i) to the genetic marker on the genetic 
map, thereby obtaining a linkage between the physical marker and the 
genetic marker; 

(k) repeating steps (g-j) for all genetic markers and align the contigs obtained to 
thereby obtain an integrated physical and genetic map; 

wherein the forward and reverse AFLP primers used in step (b) and (j^ comprise K 
respectively L selective nucleotides at the 3'-end of the primer, wherein the forward and 
reverse AFLP primers used in step (d) comprise M respectively N selective nucleotides at 
the 3*-end of the primer, wherein K, L, M, N are integers &om 0 to 10, and wherein K+L > 
M+N. 

The method of the invention is flexible. Flexibility in this respect refers to the 
starting material. The method of the invention may start from whole genomes of any 
species, but is also applicable to only a part of the genome or a selected regionof the 
genome or a chromosome or part thereof. 

The method starts by using individual genetic markers. It is preferred that these 
markers have been integrated on a genetic map, but this is not a pre-requisite. The method 
of the inventions is equally well suited to link physical markers to genetic markers, after 
which the physical markers are aligned to from a physical map and at the same time 
provide a genetic map. The genetic map may comprise only one single genetic marker but 
preferably the map comprises two or more genetic markers. Each of the genetic markers is 
characterised by an AFLP fragment or by a combination of AFLP fragm^ts. The second 
step is the provision of a library of artificial chromosomes. The library can be a Bacterial 
Artificial Chromosome Ubraiy (BAG) or based on Yeast Artificial Chromosome (YAC). 
Other libraries such as based on Cosmids, PAC, TAC or MAC are also possible. Preferred 
is a BAC library. The library is preferably of a high quality and preferably is a high insert 
size genomic library. This means that the individual BAC contains a large insert of the 
genomic DNA under investigation. The size of the preferred large insert is species- 
dependent Throughout this application reference is made to BACs as examples of artificial 
chromosomes. However, it is noted that the present invention is not limited thereto and that 
other artificial chromosomes can be used without departing from the gist of the invention. 
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Preferably the libraries contain at least five genome equivalents^ more preferably at least 7, 
most preferably at least 8. Particularly preferred is at least 1 0. The higher the number of 
genome equivalents in the library the more reliable is the resulting ,contig and flie 
integrated maps. 

5 The individual clones in the library are pooled to form pools containing a multitude 

of artificial chromosomes or clones. The pooling may be the simple combination of a 
number of individual clones into one sample (for example, 100 clones into 10 pools of 10 
clones), but also more elaborate pooling strategies may be used. The distribution of the 
clones over the pools is preferably such that each clone is present in at least two or more of 
10 the pools. The pools are generated based on pooling strategies well known in the art. The 
skilled man is capable selecting the optimal pooling strategy based on factors such as 
genome size etc. The resulting pooling strategy will depend on the circumstances, and 
examples thereof are plate pooling N-dimensional pooling such as 3D-pooling, 6D-pooling 
or conqplex pooling. 

1 5 Each of the pools is fingerprinted using AFLP and preferably with primer-pairs 

having a high selectivity to thereby generate fingerprints that contain a limited number of 
bands or fi-agments. Preferably the primer-pairs used in the fingerprinting of the pools are 
the same primer-pairs that are xised to identify the AFLP firagment characteristic of the 
genetic markers in step (b). The primer-pairs with a high selectivity contain more selectiye 

20 nucleotides than the primer with a low selectivity and are for instance +3/+3 primer-pairs 
or +2/+3 primer-pairs. 

The fingerprints of the pools are subsequently screened for the presence or absence . 
of the firagments of interest that are associated with the genetic marker(s). From a pool (or 
pools) in which the desu:ed firagment has been foimd present, each of the individual clones 

25 in that pool (or pools) is fingerprinted using AFLP. The individual clones are preferably 
fingerprinted with primer-pairs having a low selectivity to thereby generate fingerprints 
that contain a large number of bands or firagments that later aid in the generation of reliable 
contigs. In general in this method, the AFLP fingerprints of the genetic markers, the pools 
and the individual clones are preferably obtained using the same conibiinations of 

30 enzyme(s) and adapters. Preferably only the primers are di£Eerent and preferably only the 
selective bases are different in number and/or in type. The primer-pairs with a low 
selectivity are for instance +0/-K) primer-pairs or +1/-H) primer-pairs. 

The fingerprints of the individual clones that have been obtained are now subjected 
to a selection step whereby those firagments are selected that are most likely coimected to 

35 the genetic marker of interest. This is achieved by subjecting the individual clones 
containing the desired firagment to a selection step, preferably in silico. 
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In this selection step, the fingerprinting patterns of the individual clones are 
compared. When fingerprinting patterns are compared that are derived from clones with 
overlapping inserts, the respective fingerprints will contain one or more bands/fragments 
that are similar in place, intensity, type etc. (See for example Fig 1., the bands are marked 
as #,'^ $). With fingerprint patterns that are derived from clones that do not have 
overlapping inserts the chance that one or more bands express such a similarity is 
significantly lower. When, based on the fingerprint patterns, it is likely that the associated 
inserts of the clones are overlappmg inserts, aUgnment in silico results in the generation of 
a contig. This contig is then linked to the genetic marker and the genetic marker can be 
positioned on the genetic map. Repeating these steps for various genetic markers ultimately 
fesuHs in positioiiing of all coiitigs on the geiietic map and in the integration of the physical 
and genetic map. 

The advantage of the fingerprinting of the clones using primers with low selectivity 
is that many fiagments are ampUfied. Ctonsequently, a large number of physical markers 
are identified. TypicaUy 4-5 times more scorable fragments are generated with a +0/+) 
ampUfication compared to +2/+3 amplification and leads consequently to a 4 or 5 times 
higher resolution of the contigged clones. In this way, a high-iesohition physical m^ is 
generated. This is advantageously in the generation of contigs and it will reduce the 
number of singletons that may occur in the generation of contigs. 

The term 'contig' as used herein refers to a continuous sequence of DNA that has 
been assembled fix>m overlappmg cloned DNA fragments, foUowmg the definition 
provided in 27te Encyclopedia of Molecular Biology (1994, Blackwell). 

In general, the primers of low and high selectivity are characterised in that the 
forward and reverse AFLP primers of high selectivity used in the fmgeiprinting of the 
individual clones and m the identification of the AFLP fragment associated with the 
genetic marker comprise K respectively L selective nucleotides at the 3'-end of the primer, 
wherein the forward and reverse AFLP primers used in the fmgerprinting of the mdividual ' 
clones fixMn the library comprise M respectively N selective nucleotides at the 3 '-end of die 
primer, wherem K, L, M, N are integers fiwm 0 to 10, and wherein K+L > M+N. Thus the 
primers of high selectivity contain at least one selective nucleotide more than the primers 
of low selectivity. 

Ahgnment is performed by comparing band patterns and intensities of the 
fingerprint to identify overlappmg patterns. Clones contaming overlapping patterns overly 
can be aligned based thereon. Due to the use of AFLP with low selective primers, the 
physical markers present in the clones are abundant and hence the aUgmnent thereof can be 
performed with great accuracy. The contig can be generated using any means known in the 
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art. The clones can be aligned iising software well known in the art for these purposes such 
as FPC (Soderlund C, I. Longden, R. Mott, 1997, FPC: a system for building contigs &om 
restriction fingerprinted clones. Comput. Applic. Biosci. 13:523-535). This software can be 
used in the method of the present invention with cut-off parameters that are between 10"^ - 
10"^^ , preferably between 10"^ - lO'^"* , more preferably between 10"^ - 10"^^ , ia particular 
between 10'^ - 10"'^ , more preferably between 10"^ - 10"^^ , and most preferred 10"'^. Cut-off 
is a parameter that that represents the threshold value representing the maximum allowable 
probabiUty of a chance match between any two clones. Tolerance is a parameter that is a 
measure for the maximum distance or bandwidth which two bands from two different 
clones can differ and still be considered the same band. In this invention^ tolerance is rated 
from 0 (identical) to 5 (difference of 0:5 base pair)- Rreferably, tolerance is as low as 
possible (i.e. 0), but in practice a value of 3-4 (difference of 0.3-0.4 base pair) is acceptable 
and 1-2 is preferred. 

The gCTierated contigs are then linked to the genetic marker on the. genetic map and 
the whole process is repeated for the other genetic markers of the genetic map thereby 
integrating the genetic map with the physical map. 

The method according to the invention has certain advantages over methods > 
disclosed in the art. One of the advantages is that the resulting integrated map has high 
resolution due to the fingerprinting of the individual clones with primer-pairs of low 
selectivity. The amplification step of AFLP with low selective primers will result in the. i 
amplification of more adapter-ligated restriction fragments and consequently to more . ; 
-physical markers per clone as compared to similar fingerprinting with primers of high 
selectivity. Due to this increased number of physical markers perxlone the subsequent 
generation of a contig is also of a better quality and more clones can be correctly aligned . 
resulting in an improved integration of the physical and genetic m^. 

Another advantage is in ttie speed of the generation of the contig. With the method 
of the present invention relative small sets of individual clones are selected tiiat are 
associated with the AFLP firagment of the genetic marker of which a contig is generated. 
Small sets have the advantage that the generation of a contig is significantly faster than 
when larger sets are used. After placement of the contigs in the genetic m^^ a more 
reliable integrated map is obtained by aligning the placed contigs and verifying the 
aligmnents with the position of the genetic marker on the genetic map. One of the fiirther 
advantages is that the relative genetic distance (generally in centimorgan) are now 
converted to physical distances (in basepairs) and may provide a correction or 
enhancement of the genetic map, for instance by reversing the order of genetic markers or 
by more accurately determining the distance between markers. One of the most distinct 
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advantages is that each step is verifiable and correctable. This means that in case an 
amplification or a detection, for instance on a gel, is not of the desired quality, this step can 
be immediately corrected until data of the desired quality are obtained. 

A significant advantage of the method of the invention over the methods known in 
the art is that the present method provides for the detection of non-polymorphic markers 
and their subsequent integration in the integrated map. Another advantage associated with 
the method of the invention is that the generation of physical mapping data and linkage to 
the genetic map are achieved directly from the start of the method (for each marker 
separately). 

In one embodiment the method can also be used for the linking or integrating of a 
physical" and genetic marker. The method then comprises tiie steps of: 

(a) characterising the genetic marker by means of at least one AFLP fragment 
identified flirough AFLP fingerprinting; 

(b) providing an artificial chromosome library such as a BAG or YAC; 

(c) generating a multitude of pools, each pool containing a multitude of artificial 
chromosomes from the library; 

(d) generating an AFLP fingerprint for each of the pools; 

(f) selecting from the multitude of pools those individual pools in which the 
AFLP fragment associated with the genetic marker is present in flie 
fingerprint of the individual pool; 

(g) generating an AFLP fingerprint of the individual artificial chromosomes from 
the pools identified xmder (f); 

(h) selecting from the fingerprints of the individual artificial chromosomes of 
step (g) the individual artificial chromosomes in which the AFLP fragment 
associated with the genetic marker is present in the fingerprint; 

(i) generating a contig of the individual artificial chromosomes identified in step 

(j) linking the contig obtained in step (i) to the genetic marker on the genetic 
map thereby obtaining a Knkage between the physical marker and the genetic 
marker; 

wherein the forward and reverse AFLP primers used m step (b) and (f) comprise K 
respectively L selective nucleotides at the 3*-end of the primer, wherein the forward and 
reverse AFLP primers used in step d comprise M respectively N selective nucleotides at 
the 3 ' -end of the primer, wherein K, L, M, N are integers from 0 to 1 0, and wherein K+L > 
M-hN, 
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In this embodiment the present invention is used for the single integration of a 
genetic marker with a (group of) physical marker(s). By repeating these steps for all 
available genetic markers, the complete integrated map can be constructed, but in this 
embodiment this is entirely optional, 
5 In one embodiment, the diflFerence in the number oJf nucleotides that are used in the 

set of primers with low selectivity compared to the set of primers of high selectivity, also 
depicted as (K+L) - (M+N), is at least 1, preferably at least 2, more, preferably at least 3, 
most preferably at least 4, In one embodiment, the number of selective nucleotides used in . 
the set of primers of low selectivity, also depicted as is M+N, is at least 0, preferably at 
10 least 1, more preferably at least 2, most preferably at least 3. 

The pools of the individual clones that are used in the method of the present 
invention preferably contain between 0.001 and 1 genome equivalent . of the total genome 
to be analysed, preferably between 0.1 and 0.75, more preferably between 0.15 and 0.60, 
most preferably between 0.20 and 0.50. More preferred is between 0.25 and 0.35, more 
15 preferably 0,4, most preferably 0.3. 

In general, a pool of individual clones is limited in the number of clones it can 
contain. In case of exceptionally large genomes, libraries or very small inserts, it may be 
advantageously to introduce and additional pooling step. Therefore in a preferred . 
embodiment of the invention, the method comprises an additional pooling step. The 
20 additional pooling step results in pools that can be fingerprinted using primers with the 

same (Wgh) selectivity as are used for the fingerprinting of the pools of the initio . 
step. It is also possible to use primers of a intermediate selectivity, that is, primers, with a 
number of selective nucleotides (P+Q) between K+L and M+N such that K+L > P+Q > 
M+N. 

25 la an altraoative embodiment, it is possible to avoid the use of pools of clones 

altogether by generating a contig of all individual clones that have been fingerprinted using 
primers with low selectivity. Optionally a pre-selection of the individual clones is made 
wherem the clones at least contam the firagment that is associated with the genetic marker. 
More in detail: the individual fingerprints obtained using primers of low selectivity are 

30 screened for the presence of the (fiagment) marker associated with flie genetic markw that 
has been identified using the primers with high selectivity. This embodiment is 
advantageously for instance when smaller genomes are investigated or when only a few 
genetic markers are integrated with the physical markers. In such cases there is no real need 
for a pooling step. 

35 In one embodiment to characterise all genetic markers available by AFLP it may be 

preferred to use different primers and/or primer combinations for different genetic markers. 
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The primers and/or primer combinations for the fingerprinting of the pools and the 
individual clones can then to be ad^ted accordingly. It is even possible to use different 
enzyme combinations for different genetic markers. The associated contigs of BACs can 
be aligned based on the overlap of the BAG contigs and the occmrence of identical 
. physical markers. This is schematically depicted in Fig 3. 

The method of the invention is in principle suitable for the linkage of one or more 
physical markers to a genetic marker and for the linkage of physical markers to the genetic 
map comprising a multitude of genetic markers. It is preferred that the genetic markers are 
spread evenly across the map. It is further preferred that the genetic m^ is a map that is 
. !?.?!® as a genetic m^ with a high marker density. This faciUtates the linkage 

of (the conti^of) the physical itnarkera to the genetic map and enhances the quahty and 
rehability of the resulting mtegrated physical and genetic m^. Preferably the genetic m^ 
has an (average) density of at least 1 genetic maiker per 1000 kb, preferably per 500 kb 
more preferably per 200 kb, most preferably per 100 kb. Particularly preferred is a genetic 
m^ with a density of at least 1 genetic marker per 50 kb. 

When all genetic markers are linked to a contig draived from the individual clones 
in the library, g^s may remain and/or clones may remain that have not been placed on the 
map. Due to the increased density of the physical markers obtained by the fingerprinting of 
the individual clones with primers of low selectivity, the placement of these clones can be 
accomplished using conventional aligmnent software, and using techniques such as BAG 
walking and using the BAG end sequences. 

Li another aspect, the present invention pertains to the use of AFLP primers for the 
integration of genetic and physical maps and to the use of AFLP and AFLP pruners in 
linking genetic and physical genome maps. AFLP and AFLP primers have demonstrated to 
provide an exceUent technique for the integration of the physical and genetic maps. 

In a fimher aspect the invention pertains to the use of a first and a second pair of 
AFLP primers in a method for linkmg genetic and physical genome maps wherein the first 
pair of AFLP primers comprise K respectively L selective nucleotides at the 3'-end of the 
primer, wherein the second pair of AFLP primers used comprise M respectively N 
selective nucleotides at the 3'-end of the primer, wherein K, L, M, N are integers from 0 to 
10, and wherein K+L > M+N. 

By linking the physical contigs to the geaetic markers a first integrated m^ can be 
obtained. Based thereon fiirther unprovement, correction and refinement of the genetic map 
is possible by generating a contig of the various physical markers linked to a genetic 
marker. 
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The invention provides for the construction of high resolution Integrated genetic 
and physical map of any genome without the need for prior knowledge of the sequence. 
The invention furlher facilitates the efficient construction of detailed physical maps of, 
optionally pre-selected, regions of any genome. The invention ftirther facilitates the 
discovery of novel BAG derived (AFLP) markers, both polymorphic and non-polymorphic 
for the integrated genetic map. 



Description of the Figures 
Figure 1 discloses a schematic rqwesentation of the integration of a physical and a genetic 
map. The method starts by identifying genetic markers using AFLP fingerprinting. In this 
case a genetic marker was identified using Hindm and Msel as restriction endonucleases in 
the AFLP fingerprinting and Hind+2 and Mse+3 high selective primers carrying AT and 
CCA as selective nucleotides at their 3' end, respectively. The identified AFLP fiagment 
associated with the genetic marker has a length of 344 bp. A BAClibtary is prepared 
comprising Z clones. The individual clones are pooled (pool I contains clones 1, 2 and 3; H 
contains 3, 5 and Z; and m contains 1,4. and 5) and the pools are fingerprinted using AFLP 
with identical enzymes and primers with the same selectivity as used in the identification 
of the genetic marker (Hind+2(AT) and Mse+3(CCA)). This positively identifies those 
pools that contain the fragment AT/CCA-344 (pools I and ID). From each clone present in 
the pool Qjools I and m) that contains the fi-agment of mterest, (1, 2, 3, 4, 5) an AFLP 
fingerprint is generated, using primers of lower selectivity, in this case +0/+0. Each of the 
individual clones is fingerprinted and bands with 344 bp length can be identified ( in 2, 3, 
4, 5). These bands may very well be unrelated to the genetic marker of interest, as they ako 
include any fiagment that contains other nucleotides than the AT/CCA combination at the 
respective ends. From the individual fingerprints containing the 344 Augment the clones 
are selected in silico that are most likely to be linked to the AFLP fragment of interest. 
This selection stqp is based on the comparison of one or more bands in the fingerprint 
patterns. The more corresponding bands are found in the fingeiprints (the bands are 
depicted as #,'^ $), the higher the likelihood that these fingerprints are able to form a contig 
and thus aUgn the individual clones &om the library to form a contig. The individual clones 
that contain a fi-agment with a length of 344 bp and that are selected based on,the in sOico . 
process of the fingerprint patterns are aligned using FPC to form a contig. This contig can 
be placed on the genetic map. False positive firagments (i.e. that contain the fiagment 
AT/CCA-344, but are not linked to the genetic marker of interest) wiU not form a contig 
under the conditions used. Repeating these steps for other genetic markers results in the 
formation of an integrated physical and genetic map. 
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Figure 2: A: Hindin+2/MseI+3 fingerprints of BAG pools of Pristionchus padficus with 
the previously identified AFLP firagmaits characteristic of the genetic maricers indicated by 
arrows. 

B: /Trndin+O/Myel+O fingerprints of individual BACs of corresponding positive pools in 
Fig 2A.and indication of the characteristic AFLP bands (BACl and BAC2). 
C: Corresponding iSndin+O/Myel+O BAG contig and identification of the BAGl md 
BAC2 clones. 

Figure 3: Schematic representation of the method for the integration of physical and 
genetic maps, whereby for different genetic markers the linkage between the physical BAG 
clones and the respective genetic markers is obtained by fingerprinting the BAG pools and 
the individW finjgerprints with different restriction enz^ 

fingerprinting for the different genetic markers. A, B and C are genetic markets. REl and 
RE2 are dijBFerent restriction enzyme combinations (for instance Hindm/Msel and 
EcoRI/Msel). 1-8 are BAG clones. 

A genetic marker A has been linked to physical BAG clones 1-4 using enzyme 
combination REl in the AFLP fingerprinting of the donorparents, the BAG pools and the 
individual BAGs according to the method of the invention. Genetic marker O has been 
mapped to BG clones 6, 9 and 10 using the same enzyme combination REl. Genetic 
markers T and G have similarly been mapped to BAG clones 2,5, 8 and 1,6,7 respectively, 
using enzyme combination RE2. Based on the common presence of BAG clones 1 and 2 in 
A and in B and G respectively, the genetic marker A identified with a different enzyme 
combination than genetic markers B and C can now be placed on the common integrated 
m^. Based on the common presence of BAG clone 6 in O and G respectively, genetic 
marker T can be located adjacent to genetic marker G on the integrated map, thereby 
providing for an integrated map of genetic markers T, A, G and O tiiat have been identified 
using different AFLP enzyme combinations. 



Examples 

BAG library constmction 

High molecular weight DNA of the nematode Pristionchus pacificus was used to 
constiiict a BAG library. PartiaUy HinSm. digested DNA of Pristionchus pacificus was 
sq)arated with pulsed field gel electrophoresis and ligated into the pIndigoBAG-536 
vector. The ligated products were electroporated into electrocompetent DHIOB E. coli cells 
and die average insert size was deteraiined on field inversion gel electrophoresis. The 
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Pristionchus pacificus BAG libraiy contains 13.500 clones with an all over average insert 
size of >125 kb. The estimated goiome coverage of the library is estimated to be at least 7 
X, assuming a 200 Mb total genome size of Pristionchus pacificus. 

5 The poolmg of the B AC library 

The pooling of the BAG library was perfomied by pooling individual 96-well . 
plates. Each 96-well plate was inoculated on a standard TY-agarplate with 
chloroamphenicol selection (12.5 mg/ltr), and incubated ovemight at 37 Gelsius. In the, 
morning the 96 colonies were resuspended in 1 ml of liquid TY mediimi and collected 
1 0 from the agarplates. A standard miniprep procedure was performed and the pooled BAG 
DNAwasiisedforAFLP template prep 

DNA preparation of individual BACs and BAG pools 

DNA from individual B AGs was prepared according to standard procedures 
15 (Sambrook, J., Fritsch, B,F., and Maniatis, T., in Molecular Gloning: A Laboratory 

Manual. Gold Spring Harbor Laboratory Press, NY, Vol, 1, 2, 3 (1989).) An ovemight L5 
ml culture in TY medium and chloroamphenicol (12.5 mg/ltr) selection generates sufficient 
DNA for AFLP template preparation. 

20 AFLP analysis of individual BACs and BAG pools 

AFLP templates ofindividual BAG DNA, the BAG pools and the two donorparents 
that were used in the genetic mapping population were prepared according to standard 
procedures (Vos et al Nucleic Acids Research (1995), 23, 4407-4414 ) using /fmdlll and 
ikCfel as the enzyme combination. 

25 

For the +0/+0 amplification of the individual BAG fi:agments, no preamplification 
step was necessary. The -H)/+0 amplification step was directly perfomied on the 1 : 10 
diluted template of the individual BAG DNA. The amplified +0/-K) firagments were 
detected and sized on a MegaBAGE 1000 capillaiy electrophoresis platform. 

30 AFLP reactions on the BAG pools were preceded by +1/+1 preamplifications with 

one selective nucleotide on each primer. AFLP reactions were performed with primer 
combinations having 2 selective nucleotides on the Hindm primer and 3 selective 
nucleotides on the Msel primer. 

The amplified +2/+3 AFLP firagments were detected and sized on the same 

35 MegaBAGE 1000 capillary electrophoresis platform as the +0/+0 fingerprints. 
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AFLP adaptor primers (S*-3*) 

The following primers were used to generate the fingaprints of the BAG clones 



91M35 : ^/«din-adaptor primer forward (5'-3') 
91M37 : //indin-adaptor primer reverse (5'-3') 
92A18 r Mj-el-adaptor primer forward (5'-3') 
92A19 : Myel-adaptor primer reverse (5 '-3') 



CTCGTAGACTGCGTACC 
AGCTGGTACGCAGTCTAC 
GACGATGAGTCCTGAG 
TACTCAGGACTCAT 



10 



15 



20 



Non selective primer sequences (5'-3') 
93Q34 : Hindm+O primer (5 '-3') 
93E40 : Msel+0 primer (5'-3 ') 



: GACTGCGTACCAGCTT 
: GATGAGTCCTGAGTAA 



AFLP selective primer sequences (5'-3^) 

Hmdm+2 primers can be labeled with FAM, JOE or NED for detection on MegaBACE 
platform. 

Hindm +2 primers (5*-3') ; 

Constant sequence: GACTGCGTACCAGCTT 
Selective nucleotides: 



Msel +3 primers (5 '-3') : 



AA 


CA 


CiA 


LA 


AC 


CC 


GC 


TC 


AG 


CG 


GG 


TG 


AT 


CT 


GT 


TV 



Constant sequence: GATGAGTCCTGAGTAA 
Selective nucleotides: 



AAA 


lAA 


CAA 


GAA 


AGA 


TGA 


CGA 


GGA 


AAC 


TAC 


CAC 


GAC 


AGC 


'i'GC 


CGC 


GGC 


AAT 


TAT 


UAl 


GAT 


AGT 


'I'GT 


CGT 


GGT 


AAG 


TAG 


CAG 


GAG 


AGG 


I'GG 


CGG 


GGG 


ACA 


TCA 


CCA 


GCA 


ATA 


'ITA 


CTA 


GTA 


ACC 


TCC 


CCC 


GCC 


ATC 


Tl'C 


CTC 


GTC 


ACT 


TCT 


CCT 


GCT 


A'lT 


TIT 


CIT 


GTr 


ACG 


TCG 


CCG 


GCG 


ATG 


TIG 


CTG 


GTG 



Software analysis 

All MegaBACE traces were analysed by in-house developed software and the 
resulting datasets of the ^^idin+O /Msel+0 individual fingerprints were processed by FPC 



P034 P206194EPBSW 

-17- 

software (Soderlund et aL ^nde supra) The JKwdin+2 /MpeI+3 AFLP fingerprints of the 
pooled BAG DNAs and the donoiparerits of the genetic mapping pppiilation were also 
analysed by in-house software. The link of the Hin6Bl+2 1 fragment to the . , 

jfiTmdin-K) / MseL^O fragment in the corresponding individual BAG fingerprint was made in 
silica by searching fragments of comparable length within a certain error margin. The 
contigs were generated with a cut-off parameter set at various values between 10'^ and lO'''* 
and the resulting contig evaluated. Tolerance was set at 2-4. ( i.e. 0.2-0.4 basepairs) , 

Results and conclusions 

Crenetic markers have been linked to BAG contigs in a very efQcient way and 
multiple genetic markers havis been cdmbined into ian integrated physical and genetic map. 
The concept is applicable to different species with varying genome sizes with success^ and 
without prior knowledge of the specific genomic sequence. The method provides for the 
effective integration of physical and genetic maps and will have an important impact in the 
genomic arena. 
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Claims 




A method for providing an integrated genetic and physical map of a genome 
or part thereof, flie method comprising the steps of: 

(a) providing at least two individual genetic markers for the genome or 
part thereof, preferably in the form of a genetic map; 

(b) characterising the genetic markers by means of at least one AFLP 
fragment identified through AFLP fingerprinting; 

(c) providing a library of clones comprising fragments of the genome or 
part thereof, preferably an artificial chromosome library such as a 
BACorYAC; 

(d) generating a multitude of pools, each pool containing a multitude of 
individual clones firom the library; 

(e) generating an AFLP fingerprint for each of the pools; 

(f) identifying in the multitude of pools a pool in which an AFLP 
fi-agment identified in (b) is present in the fingerprint of the pool; 

(g) generating an AFLP fingerprint for each of the individual clones in 
the pool identified in (f), and identifying the clone containing the 
AFLP fragment identified in (b) in its AFLP fingerprint; 

(h) generating a contig comprising the individual clone identified in step 

(g); 

(i) repeating steps (f-h) for at least a second AFLP fragment identified 
in (b) whereby the second or fiirth^ AFLP fragments characterise a 
second or fijrfhCT genetic marker; and, 

0) linking at least two of the contigs obtained in (Ji) to thereby obtain an 
integrated physical and genetic map of the genome or part thereof, 
which comprises at least two genetic markers; 
wherem the forward and reverse AFLP primers used in step (b) and (e) comprise K 
respectively L selective nucleotides at the 3'-end of the primers, wherein the forward 
and reverse AFLP primers used in step (g) comprise M respectively N selective 
nucleotides at the 3 '-end of the primers, wherein K, L, M, N are integers from 0 to 
10, and wherein K+L > M+N. 

A method for linking a genetic marker to a physical marker in a genome or 
part thereof, the method comprising the steps of: 

(a) characterising the genetic marker by means of at least one AFLP 
fragment identified through AFLP fingerprinting; 
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(b) providing a library of clones comprising fragments of the genome or 
part thereof, preferably an artificial chromosome library such as a 
BACorYAC; 

(c) generating a multitude of pools, each pool containing a multitude of 
individual clones from the library; 

_(^) . ... Sf^^^^Sj^ AFLP fingerprint for each of the pools; 

(e) identifying in the multitude of pools a pool in which an AFLP 
fragment identified in (a) is present in the fingerprint of the pool; 

(f) generating an AFLP fingerprint for each of the individual clones in 
the pool identified in (e), and identifying the clone containing the 
AFLP fragment identified in (a) in its AFLP fingerprint; 

(S)' ' generating a contig comprising the individual done identified in step 

(f), thereby linking the genetic marker to a physical marker; 
wherein the forward and reverse AFLP primers used in step (a) and (d) comprise K 
respectively L selective nucleotides at the 3*-end of the primer, wherein the forward and 
reverse AFLP primers used in step (f) comprise M respectively N selective nucleotides at 
the 3'-end of flie primer, wherein K, L, M, N are integers from 0 to 10, and wherein K+L > 
M+N. 



Method according to claim 2, wherein steps (a)-(g) are repeated for further 
genetic markers in the genome or part thereof and wherein the contigs obtained in 
(g) are aligned to thereby obtain an integrated physical and genetic map. 

Method according claims 1-3, wherein (K+L) - (M+N) is at least 1, 
preferably at least 2, more preferably at least 3, most preferably at least 4. 

Method according claims U4, wherein M+N is at least 0, preferably at least 
1, more preferably at least 2, most preferably at least 3, 

Method according claims 1-5, wherein each pool contains at most 0.6 
genome equivalent of the total genome to be analysed, preferably 0.5 more 
preferably 0.4, most preferably 0.3. 

Method according to claims 1-6, further comprising an additional pooling 

step . 



8. 



Method according to claims 1-7, wherein the genetic markers are provided 
with a density of at least one genetic marker per 100 kb. 



-20- 



9. Method according to claims 1-8, wherein the contigs are aligned using a 

computer program suitable for aligning such as FPC 

5 10. Method according to claims 1 -9, wherein the artificial chromosome library 

contains at least 5 genome equivalents. 

1 1 • Use of AFLP primers for the integration of genetic and physical maps. * 

10 12. Use of AFLP in linking genetic and physical genome maps. 

13 . Use of AFLP primers in Mnldng genetic and physical genome maps. 

14. Use of a first and a second pair of AFLP piimers in a method for linking 



genetic and physical genome maps whwein the first pair of AFLP primers coiriqprise 
K respectively L selective nucleotides at the 3'-end of tiie primer, wherein the 
second pair of AFLP primers used comprise M respectively N selective nucleotides 
at the 3 '-end of the primer, wherein K, L, M, N are integers from 0 to 1 0, and 
wherein K+L > M+N. 
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Abstract 

The invention pertains to a method for the integration of physical and genetic maps 
and markers. The method is based on the use of AFLP fingerprinting with primers of 
varying selectivity on a library of artificial chromosomes such as a BAC library. The 
fingerprinting is performed on the individual BACs and on the pools of BACs. Subsequent 
alignment o generates a contig and provides the integration of physical and genetic markers 
resulting in the integration of the physical and genetic map. 
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